Creating a Spontaneous Conversational Speech Corpus
نویسندگان
چکیده
منابع مشابه
Creating a Spontaneous Conversational Speech Corpus
Speech recognition and language analysis of spontaneous speech arising in naturally spoken conversations are becoming the subject of much research. However, there is a shortage of spontaneous speech corpora that are freely available for academics. We therefore undertook the building of a natural conversation speech database, recording over 200 hours of conversations in English by over 600 local...
متن کاملCreating a speech corpus with semi-spontaneous, parallel conversational and clear speech Tech Report: CSLU-11-003
Our goal is to collect a speech corpus for the purpose of studying intelligibility and acoustic differences between the conversational and clear speech styles. The ideal corpus has the following properties: (1) speech has been produced spontaneously as part of a communicative interaction, as opposed to having been read to an imagined interlocutor; (2) entire identical utterances, or large parts...
متن کاملSpontaneous Speech Corpus of Japanese
Design issues of a spontaneous speech corpus is described. The corpus under compilation will contain 800-1000 hour spontaneously uttered Common Japanese speech and the morphologically annotated transcriptions. Also, segmental and intonation labeling will be provided for a subset of the corpus. The primary application domain of the corpus is speech recognition of spontaneous speech, but we plan ...
متن کاملDevelopments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech
This paper describes the special demands of conversational speech in the context of corpus-based speech synthesis. The author proposed the CHATR system of prosody-based unit-selection for concatenative waveform synthesis seven years ago, and now extends this work to incorporate the results of an analysis of five-years of recordings of spontaneous conversational speeech in a wide range of actual...
متن کاملJANUS-II-translation of spontaneous conversational speech
JANUS-II is a research system to design and test components of speech-to-speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features of the speech recognition component JANUS-SR, 2) the end-to-end performance of JANUS-II, including a comparison of two machine translation strategies used for JANUS-MT (PHOENIX and GLR*).
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Science Journal
سال: 2012
ISSN: 1683-1470
DOI: 10.2481/dsj.10-011